[fix](gpt-oss): fix quark quantized model in moe bias by PerryZhang01 · Pull Request #787 · ROCm/ATOM

PerryZhang01 · 2026-05-14T12:27:06Z

Motivation

This PR fixed the padding error in quantized gpt_oss. the quantized gpt-oss-120b is from quark team(https://huggingface.co/amd/gpt-oss-120b-moe-ori-attn-ptpc), it only quantized gemm weights in attention with PTPC methods. the bias in moe are padding, using empty tensor will introduce dirty data, so use zero bias data.

Co-authored-by: perzhang <perzhang@amd.com>

[fix](gpt-oss): fix quark quantized model in moe bias

3c0f267

valarLip approved these changes May 14, 2026

View reviewed changes

valarLip merged commit aa7c25a into main May 18, 2026
67 of 85 checks passed

valarLip deleted the quant_gpt_oss branch May 18, 2026 09:05

sijyang pushed a commit that referenced this pull request May 24, 2026

[fix](gpt-oss): fix quark quantized model in moe bias (#787)

e4d97a3

Co-authored-by: perzhang <perzhang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix](gpt-oss): fix quark quantized model in moe bias#787

[fix](gpt-oss): fix quark quantized model in moe bias#787
valarLip merged 1 commit into
mainfrom
quant_gpt_oss

PerryZhang01 commented May 14, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

PerryZhang01 commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

PerryZhang01 commented May 14, 2026 •

edited

Loading